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Abstract: The dynamics of HIV-1 viremia is a complex and evolving landscape with 
clinical and epidemiological (public health) implications. Most studies have relied on the 
use of set-point viral load (VL) as a readily available proxy of viral dynamics to assess host 
and viral correlates. This review highlights recent findings from population-based studies of 
set-point VL, focusing primarily on robust data related to host genetics. A comprehensive 
understanding of viral dynamics will clearly need to consider both host and viral 
characteristics, with close attention to (i) the timing of VL measurements, (ii) the biology of 
viral evolution, (iii) compartments of active viral replication, (iv) the transmission source 
partner as the immediate past microenvironment, and (v) proper application of statistical 
models. 
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1. Introduction 

HIV-1 infection typically occurs through a single viral variant [1-5], but the initial viral 
homogeneity is rather transient as the surviving viruses must mutate to evade host immune defenses or 
to regain fitness lost during adaptation to the immediate past host (the transmission source partner) [6]. 
At the population level, HIV-1 subtypes responsible for the global AIDS pandemic can vary by 
geographic region [7,8], while frequent superinfection can generate mosaic viruses (circulating 
recombinant forms) to promote viral diversity [8]. Understanding the evolution of HIV-1 -host 
interactions requires close attention to both viral and host (immunologic) dynamics [9]. 

HIV-1 viral load (VL) set-point is a well-studied phenotype tied to virus-host equilibrium, with 
high set-point VLs translating to rapid disease progression [10-17] and fast transmission to susceptible 
hosts [18, 19]. In many individuals, the viral 'set-point' is reached within weeks of infection [12, 20, 
21], and it can remain relatively steady (±0.5 logio RNA copies/ml) for years during clinical latency 
[10]. Progression to AIDS is usually accompanied by (i) rising VL, (ii) substantial loss of CD4+ T- 
cells in peripheral blood, and (hi) risk for opportunistic infections and malignancies. AIDS diagnosis 
based on < 200 CD4 cells/mm 3 of blood and at least one opportunistic infection [22,23] can serve as 
another important phenotype for measuring the dynamics of host-virus interactions, but it can take 
close to a decade to develop even during untreated HIV-1 infection. In the era of highly active 
antiretroviral therapy (HAART), AIDS diagnosis is increasingly rare, so a focus on studying set-point 
VL as a proxy of viral fitness under specific microenvironment in the host is well justified, especially 
since many clinical decisions must be made during the early stages of HIV-1 infection [9, 24]. 

Figure 1. Selection of recent (post-2010) publications for systematic review. 

Two rounds of searches in PubMed yield 2,660 original research articles that 
contain three key words (HIV, viral load and host or viral genome). Only 22 of 
these meet the criteria for full evaluation here (20 in Table 1 and two in Table 2). 

Articles printed in English journals since January 2010 (N = 2,660) 

Set-point VL as the phenotype of interest (n = 50) 
V 

Adult populations only (n = 47 remaining) 
V 

Sample size greater than 100 (n = 25 excluded) 
v 

Treatment-naive patients without AIDS-related co-infections 



(n = 22, final selection) 
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HIV-1 VL was, in one way or another, a subject in over 2,600 articles published since January 
2010 (Figure 1). Our review here intends to highlight recent population-based research on host and 
viral correlates of HIV-1 VL set-point or its equivalent. For clarity and fair comparisons, studies 
assessing the relationship between host and/or viral factors on early set-point VL were selected 
according to two phenotypes, i.e.; set-point and chronic VLs as continuous or categorical endpoints. In 
addition, it was necessary to exclude studies dealing with children or youth (rare) or with small sample 
sizes (<100 HAART-nai've subjects). In the end, a total of 22 original research articles remained after 
four rounds of selection (Figure 1). Interpretation of these recent studies is relatively straightforward 
when supporting evidence from earlier reports is available. 

2. Host Genetics and Set-Point VL 

2.1 Human Leukocyte Antigen (HLA) Class I and Class II Genes as Prominent Factors 

HLA molecules mediate immune responses through multiple mechanisms, and their importance to 
effective immune control of HIV-1 infection has been well publicized in the past two decades 
[9,25-27]. Polymorphisms around the peptide-binding groove of HLA class I (HLA-I) and HLA-II 
molecules determine the specificity of cytotoxic T-lymphocyte responses (CTLs) and T-helper cell 
epitopes, respectively [28]. Direct interactions between HLA-I and killer cell immunoglobulin-like 
receptors (KIRs) can dictate natural killer (NK) cell function [29], which is further regulated by HLA 
leader peptides loaded to HLA-E [30,31]. These intertwined properties essential to both innate and 
adaptive immunity inevitably complicate the analysis of individual HLA alleles and certain 
functionally relevant residues or motifs shared by different alleles. 

When individual HLA-I alleles are compared, new findings (Table 1) continue to support the notion 
that HLA-A and HLA-C alleles are less prominent than HLA-B alleles [32-36]. Specifically, studies 
have readily recognized HLA-B*13, B*14, B*18, B*27, B*35, B*44, B*45, B*53, B*57, B*58:01, 
B*58:02 and B*81 as distinct correlates of HIV-1 VL in several cohorts from Africa and North 
America [32-36]. Evidence for three HLA-A alleles (A*32, A*36, and A*74), two HLA-C alleles 
(C*08 and C*18), and one combination (HLA-A*30+HLA-C*03) is rather consistent with earlier 
observations, with HLA-A*74 being favorable (low VL) in native Africans and African-Americans 
[34, 35, 37-39]. Linkage disequilibrium (LD) between HLA-A*74 and HLA-B*57 may obscure the 
analysis of the former, but an independent contribution by A*74 was evident in a large sample size 
[39]. HLA-C* 18 as a favorable allele needs further assessment, as it apparently tags two favorable 
HLA-B alleles, B*57:03 and B*81:01 [34, 40]. The HLA-C* 12— HLA-B* 39 haplotype is another 
example of neighboring alleles that are hard to separate [33, 35]. 

For HLA-II (Table 1), only two alleles have shown appreciable impact on set-point VL: HLA- 
DRB 1*01:02 and HLA-DRB 1*1 3:03 are associated with relatively high and low VL, respectively 
[34, 41]. Of note, HLA-DRB 1*0 1:02 was associated with high VL in a combined cohort of 
seroconverting patients (SCs) and seroprevalent patients (SPs) or in SPs alone [34]. In theory, SCs are 
more suitable for association analyses as few viral mutations are seen in early infection when set-point 
VL is measured. The relatively late effect of HLA-DRB 1*0 1:02 (if confirmed) may reflect the delayed 
onset of high-affinity antibody responses mediated by HLA-II products. On the other hand, HLA- 
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DRB 1*13:03 is in moderate LD with HLA-B*57, but its association with low VL remained clear 
even when patients with HLA-B*57 were excluded [41]. 

When the mature HLA-B protein forms are inferred from HLA-B genotyping results, three amino 
acid residues at positions 67, 70, and 97 (Met 67 , Ser 70 and Val 97 around the C and F pockets) seem to 
explain alleles (e.g.; B*57) associated with favorable outcomes (HIV-1 control) [42]. 



Table 1. Host Genetic Factors That Are Positively or Negatively Associated with HIV-1 
Viral Load (VL) Set-Point or Assumed Set-point, as Reported in Recent Studies. 3 



Gene or gene cluster b 


Allele or haplotype c 


Ethnicity d 


Impact on VL 


Refs 




A*32 


AA 


Favorable 


[351 

1 - ^ 1 




A*36 


African 


Unfavorable 


[34, 35] 




A*74 


AA, African 


Favorable 


[33-35, 39] 




B*13 


African 


Favorable 


[341 

L "J 




B*14 


AA 


Favorable 


[351 

1 - ^ 1 




B*18 


African 


Unfavorable 


[331 

1 - 1 




B*27 


Caucasian 


Favorable 


[321 

I I 




B*35 


Caucasian 


Unfavorable 


[32, 35] 


Classical HLA class I genes: 


B*44 


African 


Favorable 


[361 

L 1 


HLA-A, 


B*45 


AA, African 


Unfavorable 


[34, 351 


HLA-B, 


B*53 


AA 


Unfavorable 


[35] 


and HLA-C 


B*57 


AA, African, 
Caucasian 


Favorable 


[32-36, 39, 43] 




B*58:01 


African 


Favorable 


[33, 43] 




B*58:02 


African 


Unfavorable 


[33, 34] 




B*81 


African 


Favorable 


[34] 




C*08 


African 


Favorable 


[35] 




C*18 


African 


Favorable 


[34, 35] 




A*30+C*03 


African 


Favorable 


[34] 




C*04:01-B*81:01 


African 


Favorable 


[33] 




C*12-B*39 


African 


Favorable 


[33, 35] 




Homozygosity 


AA and African 


Unfavorable 


[33, 35] 


HLA-DRBl 


DRB1*01:02 


African 


Unfavorable 


[34] 




DRB1*13:03 


African 


Favorable 


[41] 


Killer cell immunoglobulin-like 


KIR3DS1 copy no. 


Caucasian 


Favorable if >1 copy 


[44] 


receptor (KIR) genes 


KIR3DL1 copy no. 


Caucasian 


Favorable if >1 copy 


[44] 


CCR5 


A32 heterozygosity 


Caucasian 


Favorable 


[45] 


CCR2-CCR5 


HHD/HHE 


African 


Unfavorable 


[46] 




HHF*2 homozygosity 


African 


Favorable 


[46] 


ecu 


rs5029410 allele C 


African 


Favorable 


[47] 


DC-SIGNR (CD209L) 


7 or 9 repeats of a 69- 
bp coding sequence 


Asian (Chinese) 


Unfavorable 


[48] 


Miscellaneous loci (sporadic 


rs2395029, allele C 


Caucasian 


Favorable 


[45, 49] 


SNPs) 


rs9264942, allele G 


Caucasian 


Favorable 


[45, 49] 



a Four studies [50-53] with mostly negative results (not reaching statistical significance) are cited briefly in the text. 
b Organized by group and sorted by degree of popularity i.e.; the number of studies meeting criteria (see Figure 1). 
c Variants in bold have shown consistency between studies conducted by different investigators. Certain amino acid 
residues may account for HLA-B allelic effects (e.g.; B*57 and B*81) [42, 54], as discussed in the text. 
d AA= African American. 
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In African Americans, nonsynonymous single nucleotide polymorphism (SNPs) corresponding 
to HLA-B amino acid positions 63, 97, and 116 account for much of the effects attributable to the 
HLA-B locus [54]. However, HLA-B *44 alleles (Ser 67 , Asn 70 and Arg 97 ) that are favorable in native 
Africans did not conform to this newly recognized rule [36]. Similarity or difference in peptide- 
binding preferences alone may not fully capture the spectrum of concerted and evolving immune 
function that is essential to durable containment of HIV-1 infection [55]. 

Specific alleles and codon positions aside, HLA-I homozygosity (lack of diversity) has what 
appears to be an additive effect on set-point VL [33,35] (Table 1), probably by allowing rapid viral 
immune escape. Homozygosity is mostly restricted to common HLA-I alleles found in a given 
population, so its disadvantage may alternatively imply the advantage of rare or infrequent alleles to 
which viral adaptation is less likely to occur [56]. This concept of allele frequency-dependent 
influences on HIV-1 pathogenesis deserves further evaluation [35,38]. 

2.2 Killer Cell Immunoglobulin-like Receptor (KIR) Genes 

KIR gene products are primarily expressed on natural killer (NK) cells to inhibit or activate 
cytotoxic activities, depending on the combination of receptor-ligand (HLA-B or HLA-C) pairing 
[53,57,58]. Just like their HLA ligands, KIR molecules are highly polymorphic in terms of gene 
content and allelic diversity. In the presence of HLA-B ligand Bw4-80I, the activating KIR3DS1 and 
inhibitory KIR3DL1 may delay HIV-1 disease progression (time to AIDS or death) [57,59]. The 
specific role of KIR-HLA interaction in the early course of HIV-1 infection is not obvious [60]. 

New evidence now suggests that KIR3DS1 copy number variation is worth noting (Table 1). When 
HLA-Bw4-80I is present, one or more copies of KIR3DS1 was associated with relatively low set- 
point VL even after statistical adjustments for other known factors in the KIR-HLA interaction 
pathway, including HLA-B* 57, B*27, and B*35Px [44]. Two other recent studies found no association 
between KIR3DL1, KIR3DS1, or KIR2DS4 and viral load [52,53]. Differences in methodology and 
KIR3DS1 population frequencies may account for the lack of immediate consensus. 

2.3 Chemokine Receptors and Ligand Genes 

Several chemokine receptors, especially CCR5 and CCR2, act as HIV-1 co-receptors that facilitate 
viral entry into target cells. Neighboring CCR2 and CCR5 gene variants (haplotypes and diplotypes) 
have well-known relationships to HIV-1 transmission (initiation of infection) [61], but their role in 
established infection is not persuasive [25,62]. Heterozygosity for the 32-base-pair deletion in the 
CCR5 gene open reading frame is of epidemiological importance to various populations [62-65], so is 
the amino acid substitution of valine to isoleucine at CCR2 residue 64 (64V/I). The CCR2-CCR5 
haplotypes tagged by CCR5-A32 (HHG*2) and CCR2-64I (HHF*2) may act in concert to influence 
set-point VL in populations of European ancestry [62], but that combination (HHF*2/HHG*2) is too 
rare in other racial groups to be a worthy topic. Further work on various genes encoding CCR5 ligands 
(MlP-la, MIP-ip, and CCL5/RANTES) often leads to inconsistent or conflicting observations [66]. 

Investigation of chemokine receptor and ligand genes is still active (Table 1). Translation of CCR5- 
A32 to low set-point VL has gained new supporting evidence [45]. Modest advantage was also seen 
with HHF*2 homozygosity [46]. The HHD/HHE diplotype commonly seen in cohorts of African 
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ancestry appeared to be unfavorable [46]. More recently, the minor allele C for SNP rs5029410 (in 
the CCL3 gene) has been associated with low set-point VL [47], with a low probability of false 
discovery from multiple testing. 

2.4 Other Miscellaneous Observations Based on Candidate Gene Approach 

One study has revealed that DC-SIGNR (CD209L) genotypes can be associated with HIV-1 VL: 
the alleles encoding 7-repeat and 9-repeat isoforms appear to be unfavorable [48] (Table 1). The 
number of 23-amino acid repeats in the DC-SIGNR protein ranges from three to nine [67], and the 
reported associations can be attributed to two isoform combinations, 5/7 and 7/9. Biologically, DC- 
SIGNR and DC-SIGN are transmembrane receptors on dendritic cells that help ferry HIV-1 virions to 
tissues enriched with CD4 + T-cells [68]. Earlier work has shown some evidence about a possible 
distinction between the seven- or nine-repeat isoforms and others [67]. 

2.5 Results From Genome-wide Association Studies (GWAS) 

GWAS provide a hypothesis-free approach to identifying genes or SNPs of epidemiological 
importance. Multiple GWAS have consistently pointed to two SNPs as markers of effective immune 
control during HIV-1 infection. The first, rs2395029, is mapped to the HCP5 pseudogene. The second, 
rs9264942, is located about 35-kb upstream of HLA-C [69-76]. In Caucasians, these SNPs effectively 
tag HLA-B*57:01 and a micro RNA target site polymorphism in HLA-C 3' untranslated region (UTR), 
respectively [69,77]. Other HLA-I alleles can be involved as well [71,72,78,79]. 

Variants defined by rs2395029 and rs9264942 are highlighted in two new studies [45, 49] (Table 1). 
Separate analysis of SCs and SPs is considered useful as the effect sizes for many individual SNPs can 
vary greatly between SCs and SPs [49]. Two other GWAS based on African- Americans and native 
Africans failed to identify any SNPs with genome-wide statistical significance [50,51]. In the 
African- American cohort, the top 10 SNPs of interest are all within the human major 
histocompatibility complex (MHC) [50]. The SNP (rs2523608) with the best association signal (Table 
1) is actually in LD with HLA-B*57:03 (a favorable allele). In analysis of native Africans [51], the 
number one SNP of interest (rsl31 1 1989) is beyond the MHC region (Table 1). 

3. Viral Genetics and HIV-1 Set-point VL 

3.1 HIV-1 Genotype 

Epidemiologists and virologists are acutely aware of the evidence that defective viruses might 
partially explain spontaneous HIV-1 control, as seen in the strings of patients infected by a single 
blood donor in Sydney, Australia [80,81]. The ability of such viruses to cause sexual transmission (an 
inefficient process) is unclear, but recent analyses of 134 native Africans with sexually transmitted 
primary HIV-1 infection [36] did reveal that acute-phase VL can be low (<2,000 copies/mL) in a 
small proportion (-6.7%) of SCs. Direct experimental evidence is still elusive as infectious viruses are 
hard to recover from these subjects. Conversely, however, SCs with set-point VL below 50 copies/mL 
can have measurable acute-phase VL (>10,000 copies/mL) [36]. Other investigators have also come 
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across rare cases where elite control was possible even when highly pathogenic viruses from 
clinical AIDS patients were transmitted [82]. 

Table 2. Viral Markers That Are Associated with HIV-1 Set-Point Viral Load 
(VL), as Reported in Recent Studies. 

Viral factor Measurement Impact on set-point VL Refs 

Transmission source TSP VL correlates with set- [83] 

partner (TSP) VL point VL in linked recipients 
Heritability 

Genetic distance on High heritability in set-point [84] 

phylogenetic tree VL, from one infection to 

the next 



Following and verifying HIV-1 transmission chains are not easily done, but the assessment of 
donor and recipient VL can be useful [85]. New results from analyses of linked transmission pairs 
(Table 2) support a modest linear relationship between donor VL (chronic) and recipient set-point VL 
[83]. In a second study, genetic distances between viral sequences correlate with differences in VL 
[84], suggesting that viral genotypes should be considered during the search for quantitative trait loci. 

3.2 Interaction of Host and Viral Genetic Factors 

To properly dissect out factors (host or viral) with the greatest influence on HIV-1 evolution and 
viral load, models will need to simultaneously consider host and viral dynamics [83,85,86]. Among 
three closely related HLA-B allelic products examined in this context [43], HLA-B*57:03 appears to 
target four p24 Gag epitopes (ISW9, KF11, TW10, and QW9), but HLA-B*57:02 and HLA-B*58:01 
only target three and two of them, respectively [43]. Conceivably, these allelic forms can impose 
differential selection pressure on the viral genome. In the end, the causal factors of viral fitness can lie 
in the host and in the transmitted virus. 

4. Methodological Challenges 

4.1 Variations in Calculation of Set-point VL 

Despite its wide use, there is still no standard method for determining HIV-1 set-point VL, with 
multiple methods having been used rather randomly [87]. When a single RNA measurement is treated 
as the set-point [16,71,88], the timing of such measurement can vary greatly: (i) the visit after the first 
seropositive visit, (ii) visit at least three months after the estimated date of infection (EDI), (iii) visit at 
least six months after the EDI. Others prefer to use data from several visits [12,89], in favor of 
methods that calculate the VL phenotype as the average or as the median of multiple VL points within 
specific intervals of infection [87]. Those with more advanced statistical skills simply test repeated VL 
measurements in mixed models [36,78], but asymmetry in data structure (total visits and visit 
intervals) can be an issue. Decision to exclude patients with insufficient data can be a sticky business. 
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4.2 Early Chronic Phase Versus Chronic Phase 

Viral adaptation to the host microenvironment, including protective immune responses, is a gradual 
process. VLs taken during the early and later course of infection can possess similar traits for very 
different reasons [24,34,49], so findings are not directly comparable when the duration of infection is 
unknown. As most studies have already missed the early course of infection [65], the literature is likely 
most relevant to chronic infection when opportunistic infections (OIs) may complicate the analysis 
[33-35]. The OIs can be disparate in exposure, tissue compartments, and geography, but they are rarely 
captured in analysis of HIV- 1 VL readouts. 

4.3 Changes in Set-point VL over Calendar Time 

Several studies have noted an increase in set-point VL over time [45,90-92], while others disagree 
[93-95]. A large meta-analysis pooling results from prior studies of sero incident patients found a trend 
for a rising VL set-point over time [96]. Assuming that widespread viral adaptation does occur 
[45,96,97], one can envision that the timing of the AIDS epidemic in different populations can be 
critical. In an European population, pre-2003 set-point VL appeared to differ from post-2003 VL in 
SCs [91], accompanied by a loss of host genetic advantage conferred by CCR5-A32 and other 
prominent factors (e.g.; rs2395029/5*57:07) [45]. Likewise, patients with HLA-B*51 before and after 
2001 differed in their VLs [98], which is consistent with the hypothesis that specific CTL escape 
mutations induced by HLA-mediated immunity can reach fixation when these mutations have no 
apparent fitness costs [99]. Finding the tipping point for adapted versus unadapted viruses in each 
population is obviously another sticky business. 

4.4 Other Potential Confounders 

Cofactors not routinely considered in analysis of HIV- 1 VL can be quite obvious. For example, age 
and gender are known to influence VL [37], but they are infrequently seen in reported statistical 
models. Other less obvious factors can range from viral subtypes and its segregation with certain racial 
backgrounds [100,101] to differential distribution of important genetic variations (e.g.; CCR5-A32 and 
HLA-B*27) or the techniques used for defining them. Future studies will clearly need to apply 
multivariable models to carefully consider covariates and potential confounders. Composite scores 
based on all known factors may offer a temporary solution to simplifying the data analysis process 
[13,102], although individual factors may not have equally additive effects on HIV-1 VL. 

5. Conclusions 

HIV-1 viremia is an informative quantitative trait that varies at the individual and population levels. 
While many studies have attempted to sort out the quantitative trait loci, lack of clear consensus hints 
at various problems with study design and data analysis. 

Factors important to VL can lie in the host and viral genomes. As viral evolution shaped by host 
immune responses become more and more predictable, fine-mapping of viral and host genetics can 
begin to allow a fair assessment of primary and secondary factors for transformative research. In other 
words, an open-minded research question is not whether host factors predominate over viral factors or 
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vice versa, the two are so intertwined that their constant interactions in distinct individuals and 
populations collectively dictate the landscape of viral dynamics. The ultimate challenge (and goal) is to 
properly integrate comprehensive data on host and viral characteristics. The need for such approach is 
urgent, as datasets generated by high-throughput techniques will become overwhelmingly complex. 
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